<!DOCTYPE html SYSTEM "about:legacy-compat">
<html manifest="pamflet.manifest">
      <head>
        <title>MongoDB+Hadoop Connector — Frequently Asked Questions</title>
        <link type="text/css" media="screen, projection" rel="stylesheet" href="css/blueprint/screen.css"></link>
        <link type="text/css" media="screen and (min-device-width: 800px), projection" rel="stylesheet" href="css/blueprint/grid.css"></link>
        <link type="text/css" media="print" rel="stylesheet" href="css/blueprint/print.css"></link> 
        <!--[if lt IE 8]>
          <link rel="stylesheet" href="css/blueprint/ie.css" type="text/css" media="screen, projection"/>
        <![endif]-->
        <link type="text/css" media="screen, projection" rel="stylesheet" href="css/pamflet.css"></link>
        <link type="text/css" media="print" rel="stylesheet" href="css/pamflet-print.css"></link>
        <link type="text/css" media="screen and (min-device-width: 800px), projection" rel="stylesheet" href="css/pamflet-grid.css"></link>
        
        <script src="js/jquery-1.6.2.min.js"></script>
        <script src="js/jquery.collapse.js"></script>
        <script src="js/pamflet.js"></script>
        
        <meta charset="utf-8"></meta>
        <meta name="viewport" content="width=device-width, initial-scale=1"></meta>
      </head>
      <body>
        <a class="page prev nav" href="MongoDB%2BHadoop+Connector.html">
            <span class="space">&nbsp;</span>
            <span class="flip">❧</span>
          </a><a class="page next nav" href="Getting+Started.html">
            <span class="space">&nbsp;</span>
            <span>❧</span>
          </a>
        <div class="container">
          <div class="span-16 prepend-1 append-1">
            <div class="top nav span-16 title">
              <span>MongoDB+Hadoop Connector</span> — Frequently Asked Questions
            </div>
          </div>
          <div class="span-16 prepend-1 append-1 contents">
            <h1 id="Frequently+Asked+Questions">Frequently Asked Questions</h1><h3 id="Do+the+MongoInputFormat%2FMongoOutputFormats+use+HDFS%3F">Do the MongoInputFormat/MongoOutputFormats use HDFS?</h3><p>No. The <code>Mongo\*Format</code> code is designed to not use HDFS, instead reading and writing data directly between MongoDB + Hadoop.
</p><p>A forthcoming release will offer a <code>BSONInputFormat</code> and <code>BSONOutputFormat</code> which will allow for working offline with MongoDB backup files (in BSON format) on HDFS and S3.
</p><h3 id="How+does+the+MongoDB+%2B+Hadoop+Connector+differ+from+Sqoop%3F">How does the MongoDB + Hadoop Connector differ from Sqoop?</h3><p>From the <a  href="https://github.com/cloudera/sqoop/wiki">Sqoop Wiki</a>: <em>“Sqoop is a tool designed to import data from relational databases into Hadoop. Sqoop uses JDBC to connect to a database … and automatically generates the necessary classes to import data into the Hadoop Distributed File System (HDFS)“</em>
</p><p>The MongoDB + Hadoop Connector does not work with HDFS, instead reading and writing directly between MongoDB and Hadoop for the highest possible performance.  This also allows for Hadoop jobs to have the freshest possible view of their input data without an intermediary export process.
</p><h4 id="Is+integration+possible+between+MongoDB+and+Sqoop%3F">Is integration possible between MongoDB and Sqoop?</h4><p>As MongoDB is neither a relational database nor utilizes JDBC for connectivity, integration with Sqoop does not seem feasible at this time.
</p><p>A forthcoming release of the MongoDB + Hadoop Connector will offer a <code>BSONInputFormat</code> and <code>BSONOutputFormat</code> which will allow for working offline with MongoDB backup files (in BSON format) on HDFS and S3, without a live MongoDB database.
</p><div class="tocwrapper show">
      <a class="tochead nav" style="display: none" href="#toc">❦</a>
      <a name="toc"></a>
      <h4 class="toctitle">Contents</h4>
      <div class="tocbody">
      <div><a href="MongoDB%2BHadoop+Connector.html">MongoDB+Hadoop Connector</a></div><ol class="toc"> <li><div class="current">Frequently Asked Questions</div></li><li><div><a href="Getting+Started.html">Getting Started</a></div><ol class="toc"> <li><div><a href="Building+the+Adapter.html">Building the Adapter</a></div></li><li><div><a href="Configuration+%26+Behavior.html">Configuration &amp; Behavior</a></div></li> </ol></li><li><div><a href="Hadoop+Streaming+Support.html">Hadoop Streaming Support</a></div><ol class="toc"> <li><div><a href="Building+Hadoop+Streaming+Support.html">Building Hadoop Streaming Support</a></div></li> </ol></li><li class="generated"><div><a href="Contents+in+Depth.html">Contents in Depth</a></div></li><li class="generated"><div><a href="Combined+Pages.html">Combined Pages</a></div></li> </ol></div></div>
          </div>
        </div>
        <a class="fork nav" href="http://github.com/mongodb/mongo-hadoop"><img alt="Fork me on GitHub" src="img/fork.png"></img></a>
      </body>
    </html>